Speech Emotion Recognition Using Multihead Attention in Both Time and Feature Dimensions

نویسندگان

چکیده

To enhance the emotion feature and improve performance of speech recognition, an attention mechanism is employed to recognize important information in both time dimensions. In dimension, multi-heads modified with last state long short-term memory (LSTM)'s output match accumulation characteristic LSTM. scaled dot-product replaced additive that refers method update LSTM construct attention. This means a nonlinear change replaces linear mapping classical Experiments on IEMOCAP datasets demonstrate could emotional recognition.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Improving of Feature Selection in Speech Emotion Recognition Based-on Hybrid Evolutionary Algorithms

One of the important issues in speech emotion recognizing is selecting of appropriate feature sets in order to improve the detection rate and classification accuracy. In last studies researchers tried to select the appropriate features for classification by using the selecting and reducing the space of features methods, such as the Fisher and PCA. In this research, a hybrid evolutionary algorit...

متن کامل

Speech Emotion Recognition Using Scalogram Based Deep Structure

Speech Emotion Recognition (SER) is an important part of speech-based Human-Computer Interface (HCI) applications. Previous SER methods rely on the extraction of features and training an appropriate classifier. However, most of those features can be affected by emotionally irrelevant factors such as gender, speaking styles and environment. Here, an SER method has been proposed based on a concat...

متن کامل

Emotion Recognition from Speech Using IG-Based Feature Compensation

This paper presents an approach to feature compensation for emotion recognition from speech signals. In this approach, the intonation groups (IGs) of the input speech signals are extracted first. The speech features in each selected intonation group are then extracted. With the assumption of linear mapping between feature spaces in different emotional states, a feature compensation approach is ...

متن کامل

SVM Scheme for Speech Emotion Recognition using MFCC Feature

Emotion recognition from speech has developed as a recent research area in Human–Computer Interaction. The objective of this paper is to use a 3-stage Support Vector Machine classifier to classify seven different emotions present in the Berlin Emotional Database. For the purpose of classification, MFCC features from all the 535 files present in the database are extracted. Nine statistical measu...

متن کامل

Feature Optimization of Speech Emotion Recognition

Speech emotion is divided into four categories, Fear, Happy, Neutral and Surprise in this paper. Traditional features and their statistics are generally applied to recognize speech emotion. In order to quantify each feature’s contribution to emotion recognition, a method based on the Back Propagation (BP) neural network is adopted. Then we can obtain the optimal subset of the features. What’s m...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: IEICE Transactions on Information and Systems

سال: 2023

ISSN: ['0916-8532', '1745-1361']

DOI: https://doi.org/10.1587/transinf.2022edl8084